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Abstract 

^ , Many experimental studies have shown that the prion AGAAAAGA palindrome hy- 

f~^ I drophobic region (113-120) has amyloid fibril forming properties and plays an important 

role in prion diseases. However, due to the unstable, noncrystalline and insoluble nature 
of the amyloid fibril, to date structural information on AGAAAAGA region (113-120) 

LT^ I has been very limited. This region falls just within the N-terminal unstructured region 

PrP (1-123) of prion proteins. Traditional X-ray crystallography and nuclear mag- 
netic resonance (NMR) spectroscopy experimental methods cannot be used to get its 
structural information. Under this background, this paper introduces a novel approach 
of the canonical dual theory to address the 3D atomic-resolution structure of prion 
AGAAAAGA amyloid fibrils. The novel and powerful canonical dual computational 

5^ I approach introduced in this paper is for the molecular modeling of prion AGAAAAGA 

amyloid fibrils, and that the optimal atomic-resolution structures of prion AGAAAAGA 
amyloid fibils presented in this paper are useful for the drive to find treatments for prion 
diseases in the field of medicinal chemistry. Overall, this paper presents an important 
method and provides useful information for treatments of prion diseases. Overall, this 
paper could be of interest to the general readership of Journal of Theoretical Biology. 

Highlights 

► Study of prion AGAAAAGA amyloid fibril molecular structures. ► Sum of van der 
Waals radii regarding the minimization point of the Lennard-Jones potential energy. ► 
Mathematical model into a global optimization molecular distance geometry problem. 

► Use of a novel canonical dual computational approach to solve the model. ► Use of 
computational chemistry Amber package to refine the model. 

Key ■words: Mathematical Canonical Duality Optimization Theory, Two-body The- 
oretical Physics, Structural Bioinformatics Technology, Sensor Network Optimization 



Problem, Amyloid Fibril, Prion AGAAAAGA. 

1 Introduction 

According to a recent comprehensive review (Chou, 2011), to develop a useful model for 
biological systems, the following things were usually needed to consider: (i) the mate- 
rial of benchmark used to develop and test the model, (ii) the formulation of modeling 
method, (iii) operating procedures during the modeling process, (iv) properly perform 
the cross-validation tests to objectively evaluate the anticipated accuracy of the model, 
and (v) web-server establishment. Below, let us elaborate some of these procedures. 
In this paper, the material used to develop the model is SNHC.pdb and its 3D-crystal 
structures; the modeling method is the Mathematical Optimization methods of the 
canonical dual theory (CDT) (Gao, 2000, Gao et al., 2010, Gao et al., 2012) {procedure 
i) and of the Amber 11 package's steepest Descent (SD) method (Case et al., 2010) and 
Conjugate Gradient (CG) method (Case et al., 2010, Sun et al., 2001) {procedure 2); 
and the test to the accuracy of the model is performed by the RMSD (root-mean-square 
deviation) value of last snapshots between procedures 1-2. 

Various computational molecular dynamics approaches have been used to study PrP 
(106-126) (Kuwata et al., 2003, Wagoner 2010) but, to the best of our knowledge, to 
predict molecular structures of prion AGAAAAGA amyloid fibrils the computational 
approaches are few (Zhang, 2011, Zhang et al., 2011). Zhang (2011) successfully con- 
structed three AGAAAAGA amyloid fibril models by the standard simulated annealing 
(SA) method and several traditional optimization methods within AMBER 10 pack- 
age. In (Zhang et al., 2011), the hybrid simulated annealing discrete gradient (SADG) 
method was successfully used for modeling two AGAAAAGA amyloid fibril models 
(instead of the Insight II ( |http://accelrys.com ) package used in (Zhang, 2011)), and 



then the models were refined/optimized by the SDCG methods, SA method and SDCG 
methods again. In this paper, all the optimization approaches of (Zhang, 2011, Zhang 
et al., 2011) will be replaced by the optimization theory of CDT. Numerical compu- 
tational results show that the optimization approaches of CDT have a very perfect 
performance. It is even no need to do furthermore SDCG refinements by the AMBER 
package. We could not do comparisons (for example, the angstrom values between 
adjacent /3-sheets and /3-strands) for the models of (Zhang, 2011, Zhang et al., 2011) 
and of this paper, because these models have different number of chains and different 
structural Classes listed in (Kuwata et al., 2003). 

As we all know, the disease prions PrP'^^ are rich in /^-sheets amyloid fibrils (about 
43% /3-sheet) (Griffith, 1967). There are some classical works on the /3-sheets and 
/3-barrels (Chou et al., 1983a, 1990a, b, 1991). X-ray crystallography and nuclear mag- 
netic resonance (NMR) spectroscopy are two powerful tools to determine the protein 
3D structure. However, not all proteins can be successfully crystallized, particularly 
for membrane proteins. Although NMR is indeed a very powerful tool in determining 
the 3D structures of membrane proteins (see, e.g., (Schnell et al., 2008, Oxenoid et al., 
2005, Call et al., 2010, Pielak et al., 2010, Pielak et al., 2009,Wang et al., 2009) and 



a recent review (Pielak et al., 2011)), it is also time-consuming and costly. To acquire 
the structural information in a timely manner, one has to resort to various structural 
bioinformatics tools (see, e.g., (Chou, 2004b, Chou, 2004c, Chou, 2004d, Chou, 2005b) 
and a comprehensive review (Chou, 2004c)). Particularly, computational approaches 
allow us to obtain a description of the protein 3D structure at a submicroscopic level. 
Under many circumstances, due to the unstable, noncrystalline and insoluble nature of 
the amyloid fibrils, it is very difficult to use traditional X-ray and NMR experimental 
methods to obtain atomic-resolution structures of amyloid fibrils (Tsai, 2005, Zheng et 
al., 2006). Although X-ray and NMR techniques cannot determine the 3D structures of 
some proteins and their binding interactions with ligands in a timely manner that are 
important for drug design and basic research, many structural bioinformatics tools can 
play a complementary role in this regard as demonstrated by a series papers published 
recently (see, e.g. (Cai et al., 2011, Chou, 2004a, Chou, 2005a, Chou et al., 2003, Du 
et al., 2007, Du et al, 2010, Gong et al, 2009, Liao et al., 2011, Wang et al., 2010, 
Wang et al., 2009, Wei et al., 2009)). This paper, in some sense, presents a structural 
bioinformatics tool in view of the CDT-based mathematical optimization theory. 

The accuracy of the models presented in this paper is tested by the RMSD value. 
The last snapshot of procedure 2 will be superposed onto the last snapshot of procedure 
1, and the RMSD value is zero after the alignment by VMD 1.8.7beta5 (Humphrey et 
al., 1996). This implies to us that the CDT strategy can accurately built the prion 
AGAAAAGA amyloid fibril models. To test the accuracy of their model, some ex- 
amination validation methods are always used. In developing a prediction model or 
algorithm, the following three cross-validation methods are often used for examining 
its effectiveness in practical application: independent dataset test, subsampling (5-fold 
or 10-fold cross-validation) test, and jackknife test (Chou et al., 1995). However, as 
demonstrated by Eqs. 28-32 of (Chou, 2011), among the three cross-validation meth- 
ods, the jackknife test is deemed the least arbitrary that can always yield a unique 
result for a given benchmark dataset, and hence has been increasingly used and widely 
recognized by investigators to examine the accuracy of various models and predictors 
(see, e.g. (Chen et al., 2009, Chou et al., 2011, Ding et al., 2009, Hayat et al., 2011, 
Kandaswamy et al., 2011, Lin et al., 2011,Mohabatkar, 2010, Zeng et al., 2009, Zhou 
et al., 2007); all these papers reflect the current trend of increasingly and widely using 
the jackknife test to examine varieties of models or predictors). 

There is another criteria to evaluate the models. To avoid homology bias and re- 
move the redundant sequences from the benchmark dataset, a cutoff threshold of 25% 
was recommended (Chou, 2011, Chou et al., 2011) to exclude those proteins from the 
benchmark datasets that have equal to or greater than 25% sequence identity to any 
other. However, in this study we did not use such a stringent criterion because the 
currently available data do not allow us to do so. Otherwise, the numbers of proteins 
left would be too few to have statistical significance. 

The last procedure to develop a useful model for biological systems is a web-server 
establishment. Since user- friendly and publicly accessible web-servers represent the 
future direction for developing practically more useful models, simulated methods, or 



predictors (Chou et al., 2009), we shall make efforts in our future work to provide a 
web-server for the method presented in this paper. 

This paper addresses an important problem on neurodegenerative amyloid fibril or 
plaque diseases. The rest of this paper is arranged as follows. In the next section, i.e. 
Section 2, the CDT will be briefly introduced and its effectiveness will be illuminated 
by applying the CDT-based optimization approach to a well-known system of minimiz- 
ing the Double Well Potential function. In Section 3, the molecular modeling works 
of prion AGAAAAGA amyloid fibrils will be done. Section 3 also successfully gains 
the optimal prion AGAAAAGA amyloid fibril models by the applications of the CDT- 
based optimization theory. Furthermore refinement/optimization to these models by 
the SDCG methods of the package Amber 11 will also be done in Section 3. The zero 
RMSD value implies to us that the CDT optimization strategy can accurately obtain 
the prion AGAAAAGA amyloid fibril models. Thus, when using the time-consuming 
and costly X-ray crystallography or NMR spectroscopy we still cannot determine the 
protein 3D structure, we may introduce computational approaches or novel mathe- 
matical formulations and physical concepts into molecular biology to study molecular 
structures. This concluding remark will be made in the last section, i.e. Section 4. 



2 The Canonical Dual Approach 

We briefly introduce the CDT of (Gao et al., 2010, Gao et al., 2012, Gao, 2000) specially 
for solving the following minimization problem of the sum of fourth-order polynomials: 



(1) 



(V) : min J P(x) = V WAx) + -x'^Qx - x^/ : a; G M" i , 
M ^ 2 J 

where Wi{x) = ^Ui (^x^AiX + bfx + q j ,Ai = Aj,Q = Q'^e M"^", 

bij eM.'^,Ci,ai GM\i = l,2,...,m,x G A' CM". 
The dual problem of (V) is 

{V') : max |p'^(,) = f^ (q,, - ^ar\^^ - ^F^{,)G+{,)F{,) : ? € 5 J , (2) 
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G'^ denotes the Moore-Penrose generalized inverse of G, and Gol{G{<,)) is the column 
space of G{q). The prime-dual Gao-Strang complementary function of CDT (Gao et 
al., 2010, Gao et al., 2012, Gao, 2000) is 
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For (V) and (V^) we have the following CDT: 



+ ^x^Qx - x^f. (3) 



Theorem 1 (Gao et al, 2010, Gao et al, 2012, Gao, 2000) The problem {T"^) is 
canonically dual to {V) in the sense that if ^ is a critical point of P'^(?), then x = 
G~^{^)F{^) is a critical point of P{x) on M", and P{x) = P (<f). Moreover, if ^ ^ 
S^ = {c € Sa\G{(^) >- 0}, then ^ is a global maximizer of P'^{<;) over S^ , x is a global 
minimizer of P{x) onW^, and 

P(x) = mill P(x) = E(x, c) = max P'^k) = P'^(^). (4) 

It is easy to prove that the canonical dual function P'^(<f) is concave on the convex dual 
feasible space S^ . Therefore, Theorem 1 shows that the nonconvex primal problem (V) 
is equivalent to a concave maximization problem (V^) over a convex space S^, which 
can be solved easily by well-developed methods. Over S~ = {? G 5a|G(?) -< 0} we have 
the following theorem: 

Theorem 2 (Gao et al., 2012) Suppose that ^ is a critical point of {V ) and the vector 
X is defined by x = G+(c)F(c). If ^ € S~ , then on a neighborhood Xq x So C X x S~ 
of{x,^), we have either 

P{x) = min P{x) = H(x,c) = minP'^(?) = P'^(c), (5) 

xeXo ?G<So 

or 

P{x) = maxP(x) = H(x,?) = maxP'^{q) = P'^{^). (6) 

By the fact that the canonical dual function is a d.c. function (difference of convex 
functions) on S~ , the double-min duality i^ can be used for finding the biggest local 
minimizer of (V) and {V ), while the double-max duality ([6]) can be used for finding 
the biggest local maximizer of {V) and (V^). In physics and material sciences, this pair 
of biggest local extremal points play important roles in phase transitions. 

To illuminate that the CDT works, we minimize the well-known Double Well Po- 
tential (DWP) function (Gao, 2000) (blue colored in Fig. dj: 
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(red colored in Fig. [T]) and S'^ = {? G M^jc > 0}. Let H(x,<f)' = 0, we get three critical 
points of H(x,?): (x\?^) = (2.11491,0.236417), (x^,?^) = (-1.86081, -0.268701), (x^?3) 

(-0.254102,-1.96772). By Theorem 1, we know x^ = 2.11491 is the global mini- 
mizer of ([7]), <f^ = 0.236417 is the global maximizer of (J8]) over S'^ , and P(?^) = 
'r.{x^,^^) = P'^(c^) = —1.02951. By Theorem 2, we know that the local minimizers: 
x2 = -1.86081, <r2 = -0.268701 (over S'), P{q^) = ^(x^,?^) = P^l^^-^) = 0.9665031 and 
the local maximizers: x^ = -0.254102, c^ = -1.96772 (over S~), P{q^) = H(x^,?^) = 




Figure 1: The prime and dual double- well potential functions (Prime: blue, Dual: red). 

P'^(?3) = 2.063. 

Thus, by Fig. [1] illuminating the application of CDT to the DWP problem, we may 
see that the canonical dual approach works. The powerful of canonical dual approach is 
preliminarily shown in Tables 1-3 of larXiv:1105.2270V 3: 128. 84. iss.ng/ps.cadic/arxiv/pdf/nos/nos. 2270v3.pdf 
In the next section, we will apply this successful canonical dual approach to the molec- 
ular model building and solving problem of prion AGAAAAGA amyloid fibrils. 

3 Prion AGAAAAGA Amyloid Fibril Molecular Model 
Building and Solving 

Many experimental studies such as (Brown, 2000, Brown, 2001, Brown et al., 1994, 
Holscher et al., 1998, Jobling et al., 2001, Jobhng et al., 1999, Kuwata et al., 2003, 
Norstrom et al., 2005, Wegner et al., 2002) have shown two points: (1) the hydropho- 
bic region (113-120) AGAAAAGA of prion proteins is critical in the conversion from a 
soluble PrP into an insoluble PrP'^'^ fibrillar form; and (2) normal AGAAAAGA is an 
inhibitor of prion diseases. Various computational approaches were used to address the 
problems related to "amyloid fibril" (Carter et al., 1998, Chou, 2004b, Chou, 2004c, 
Chou et al., 2002, Wang et al., 2008, Wei et al., 2005, Zhang, 2011, Zhang et al., 2011, 
Zhang, 2009). By introducing novel mathematical canonical dual formulations and 
computational approaches, in this paper we may construct atomic-resolution molecular 
structures for prion (113-120) AGAAAAGA amyloid fibrils. 



The atomic structures of all amyloid fibrils revealed steric zippers, with strong van 
der Waals interactions between /3-sheets and hydrogen bonds to maintain the /3-strands 
(Sawaya et al., 2007). About /3-sheets and /3-barrels, there are various interactions and 
motions, such as the interactions between /3-strands (Chou et al., 1982b, Chou et 
al., 1982a, Chou et al., 1983a, Chou et al., 1983b), interaction between two /3-sheets 



(Chou et al., 1986), as well as the low- frequency accordion-like motion in a /3-sheet 
and breathing motion in a /3-barrel (Chou, 1985) and their biological functions (Chou, 
1988). The "amyloid fibril" problem can be looked as a molecular distance geometry 
problem (MDGP) (Grosso et al., 2009), which arises in the interpretation of NMR data 
and in the determination of protein structure [as an example to understand MDGP, the 
problem of locating sensors in telecommunication networks is a DGP. In such a case, the 
positions of some sensors are known (which are called anchors) and some of the distances 
between sensors (which can be anchors or not) are known: the DGP is to locate the 
positions of all the sensors. Here we look sensors as atoms and their telecommunication 
network as a molecule] . The three dimensional structure of a molecule with n atoms can 
be described by specifying the 3-Dimensional coordinate positions xi,X2, ■ ■ ■ ,Xn € M^ 
of all its atoms. Given bond lengths dij between a subset S of the atom pairs, the 
determination of the molecular structure is 

CPo) to find xi,X2,--.,Xn s.t. \\xi - Xj\\ = dij,{i,j) e S, (9) 

where 1 1 • 1 1 denotes a norm in a real vector space and it is calculated as the Euclidean 
distance 2-norm in this paper. ([9]) can be reformulated as a mathematical global opti- 
mization problem (GOP) 

{V) minP(X) = E(i,j)es^^,i\\^^ " ^.H' " 4)' (!«) 

in the terms of finding the global minimum of the function P{X), where Wij, {i,j) G S 
are positive weights, X = (xi,X2, . . . ,Xn)'^ G M"^^ (More et al., 1997) and usually S 
has many fewer than n^/2 elements due to the error in the theoretical or experimental 
data (Zou et al., 1997, Grosso et al., 2009). There may even not exist any solution 
xi,X2, ■ ■ ■ ,Xn to satisfy the distance constraints in ([9]), for example when data for atoms 
i,j, k G S violate the triangle inequality; in this case, we may add a perturbation term 
-e^X to P{X): 

where e > 0. In some cases, instead exact values dij, {i,j) € S can be found, we can only 
specify lower and upper bounds on the distances: kj < \\xi — Xj\\ < Uij, {i,j) G S; in 
such cases we may penalize all the unsatisfied constraints into the objective function of 

(Ve) by adding E(j,j)g5 i^^^rlj ~ W^i ~ ^ill^'O}) + fmax|||xi - Xj|p - ulj,o\) 
into PtiX) (Zou et al., 1997, Grosso et al., 2009), where we may let dij be the inter- 
atomic distance (less than 6 angstroms) for the pair in successive residues of a protein 
and set kj = (1 — 0.05)djj and Uij = (1 -|- 0.05)dij (Grosso et al., 2009). In this paper 
we will use the canonical duality approach introduced in Section 2 (Gao et al., 2010, 
Gao et al., 2012, Gao, 2000) to solve (f9]l- (fTT]l . Because the canonical dual is a perfect 
dual with zero duality gap between prime and dual problems, we can get the accurate 
global optimal solutions of problems (|9]l- (fTT]l . Thus by canoncial dual approach we may 
successfully construct the molecular structure of prion AGAAAAGA amyloid fibrils as 
follows. 

If we look at the prion AGAAAAGA molecular modeling problem as a MDGP 
with two anchors and two sensors, we can easily construct the prion AGAAAAGA 



amyloid fibril models. In fact we may let the coordinates of these two anchors being 
variable. But, these two anchors belong to one body of Chains A and B, and the two 
sensors belong to another body of Chains G and H. This is a simple two-body problem 
model of theoretical physics, i.e. Einstein's absolute relative theory. Hence, we may 
look the coordinates of two anchors being fixed. The constructions will be based on the 
most recently released experimental molecular structures of human M129 prion peptide 
127-132 (PDB entry 3NHC released into Protein Data Bank ( http : / /www . r csb . org) on 
04-AUG-2010) (in brief, this paper will use the PrP structured region 127-132 to do 
homology modelling for the PrP unstructured region 113-120). The atomic-resolution 
structure of this peptide is a steric zipper, with strong van der Waals (vdw) interactions 
between /3-sheets and hydrogen bonds to maintain the /3-strands (Fig. [21 where the 
dashed lines denote the hydrogen bonds). In Fig. [2] we see that G (H) chains (i.e. 




Figure 2: Protein fibril structure of human M129 prion GYMLGS (127-132). 
/3-sheet 2) of 3NHC.pdb can be obtained from A (B) chains (i.e. /3-sheet 1) by 
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G{R) =0-1 A(B) + 4.77650 , (12) 



and other chains can be got by 

/(J) = GiH) + I 9.5530 I ,K{L) = G{H) + I -9.5530 I , (13) 





CiD) = A{B)+\ 9.5530 \ ,E{F) = A{B) + \ -9.5530 . (14) 



Basing on the template 3NHC.pdb from the Protein Data Bank, three prion AGAAAAGA 
pahndrome amyloid fibril models - an AGAAAA model (Model 1), a GAAAAG model 
(Model 2), and an AAAAGA model (Model 3) - will be successfully constructed in this 
paper. AB chains of Models 1-3 were respectively got from AB chains of 3NHC.pdb 
using the mutate module of the free package Swiss-Pdb Viewer (SPDBV Version 4.01) 
(|http://spdbv. vital-it. ch). It is pleasant to see that almost all the hydrogen bonds are 
still kept after the mutations, where for the donor O (oxygen) atom and the acceptor 
H (hydrogen) atom if the distance cutoff is less than 3.00 angstroms and the angle 
cutoff is less than 120.00 degrees then a hydrogen bond is kept; thus we just need to 
consider the vdw contacts only. Making mutations for GH chains of 3NHC.pdb, we 
can get the GH chains of Models 1-3. However, the vdw contacts between A chain 
and G chain, between B chain and H chain are too far at this moment (Fig.s [3ll5]) 
because the shortest distance of atoms between Chain A and Chain G, and between 
Chain B and Chain H, is still very larger than the double size of the vdw radius of 
CB carbon atom. Seeing Fig.s [3][5l we may know that for Models 1-3 at least two 
vdw interactions between A.ALA3.CB-G.ALA4.CB, B.ALA4.CB-H.ALA3.CB should 
be maintained. Fixing the coordinates of A.ALA3.CB and B.ALA4.CB (two anchors) 
((6.014,5.917,0.065), (5.658,1.630,-0.797)), letting d equal to the twice of the vdw radius 
of Carbon atom (i.e. d = 3.4 angstroms), and letting the coordinates of G.ALA4.CB 
and H.ALA3.CB (two sensors) be variables, we may get a simple MDGP with 6 vari- 
ables and its dual with 2 variables: 

P{xi,X2) = - {(xii - 6.014)2 + (xi2- 5.917)2 + (xi3- 0.065)2-3.42}^ 



+ ^ {(a;2i - 5.658)2 ^ ^^^^ _ ^_g3Q)2 ^ ^^^^ ^ Q_797)2 _ 3 42|2 ^ 



P'^(?i,?2) = -11.56?i - ^q2 _ 11.56^2 - ^?|- 

We can get a local maximal solution (-11.56,-11.56) for P (ft,<?2) and its correspond- 
ing local maximal solution to P{xi,X2)- But we need the global maximal solution of 
-P (?i)?2)- Thus, by introducing perturbation parameters e = 0.05, we have to seek the 




Figure 3: Far vdw contacts of AG chains and BH chains of Model 1. 



global optimal solutions from the perturbed problems of P{xi,X2) and P'^{<^i,(;2) 



Pe{xi,X2) 



^'(?l,?2) 



2l2 



{(xii - 6.014)2 _^ ^^_^^ _ 5 917)2 _^ ^^^^ _ Q Qg5)2 _ 3^21 



2l2 



+ 2 i(^2i - 5.658)2 ^ (^^^ _ -^ g3Q)2 ^ (^^^ ^ Q 797)2 _ 3421 

- 0.05x11 - 0.05x12 - 0.05x13 - 0.05x2i - 0.05x22 - 0.05x23, 

= 59.6233?i - 0.5q2 + 23.7451?2 - 0.5c;-| 

1 /(0.05 + 12.028?i)2 (0.05 + 11.834?i)2 (0.05 + 0.130^)2 
2V 2^1 ^ 2^1 ^ 2^1 

1 /(0.05 + 11.316?2)^ (0.05 + 3.2600?2)2 (0.05-1.594^2)^ 
2V 2^2 ^ 2^2 ^ 2^2 
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Figure 4: Far vdw contacts of AG chains and BH chains of Model 2. 

We can easily get the global maximal solution (0.0127287,0.0127287) G {? G M?\q > 
0,i = 1,2} for Pf{(^i,(;2). Then, we get its corresponding solution for P^{xi,X2)'- 

X = (7.97807, 7.88107, 2.02907, 7.62207, 3.59407, 1.16707). 
By Theorem 1 we know that x is a global minimal solution of Pe{xi,X2)- We set x as 
the coordinates of G.ALA4.CB and H.ALA3.CB and taking the average value we get 
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(15) 



By (J15p we can get very close vdw contacts between A chain and G chain, between 
B chain and H chain (Fig.s [6][8]). Thus, we successfully constructed Models 1-3, and 
through further refinements by the Amber 11 package (Case et al., 2010) we at last 
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Figure 5: Far vdw contacts of AG chains and BH chains of Model 3. 



get the optimal Models (Fig.sl9llTT|). We find the RMSD (root mean square deviation) 
between Fig.s [MEl and Fig.s [9lfTT] is zero angstroms; this implies that the Amber 11 
refinements are not necessary and the CDT is good enough to get the optimal Models 
1-3 as illuminated in Fig.s [MSI The other CDIJ and EFKL chains can be got by 
parallelizing ABGH chains in the use of mathematical formulas (|13p -()14p. 

As the end of this Section, we give some remarks on the Models 1-3. (1) The 
canonical dual approach exactly makes the closest CB atoms between Chain A and 
Chain G, and between Chain B and Chain H, just being equal to the double size of the 
vdw radius of CB carbon atom (Fig.s 6-8) and this is the perfect structure of the Models 
1-3. Fig.s 9-11 were got by the further refinements through the SDCG optimization 
methods of Amber 11 package. The zero RMSD value between Fig.s 6-8 and Fig.s 
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Figure 6: Close vdw contacts of AG chains and BH chains of Model 1. 

9-11 implies to us that the canonical dual approach of this paper works well. (2) The 
SDCG optimization methods of Amber 11 package automatically considered the bond 
angles and dihedral angles, and during the canonical dual molecular model building 
and optimization procedure, the perfect bond angles and dihedral angles automatically 
produced by the Swiss-PdbViewer package are still being kept. (3) The molecular 
modeling problem of this paper is in fact a very simple two-body problem of theoretical 
physics, i.e. Einstein's absolute relative theory. In mathematics, it is a sensor network 
problem with two anchors and two sensors. 
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Figure 7: Close vdw contacts of AG chains and BH chains of Model 2. 

4 Conclusion 

This paper presents an important method and provides useful information for treat- 
ments of prion diseases. X-ray crystallography is a powerful tool to determine the 
protein 3D structure. However, it is time-consuming and expensive, and not all pro- 
teins can be successfully crystallized, particularly for membrane proteins. Although 
NMR spectroscopy is indeed a very powerful tool in determining the 3D structures of 
membrane proteins, it is also time-consuming and costly. Due to the noncrystalline 
and insoluble nature of the neurodegenerative amyloid fibril or plaque, little struc- 
tural data on the prion AGAAAAGA segment is available. Under these circumstances, 
the novel canonical dual computational approach introduced in this paper showed its 
power in the molecular modeling of prion AGAAAAGA amyloid fibrils. This indicated 
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Figure 8: Close vdw contacts of AG chains and BH chains of Model 3. 

that computational approaches or introducing novel mathematical formulations and 
physical concepts into molecular biology can significantly stimulate the development 
of biological and medical science. The optimal atomic-resolution structures of prion 
AGAAAAGA amyloid fibils presented in this paper are useful for the drive to find 
treatments for prion diseases in the field of medicinal chemistry. 
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Figure 9: Optimal structure of prion AGAAAAGA amyloid fibril Model 1. 
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